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ABSTRACT 

In this paper we present an approach to the integrated design problem for actively controlled large, 
flexible mechanical systems for which Control Structure Interaction (CSI) problems are of concern. 

The two coupled design problems have been identified as the optimal Structural Design problem and 
the optimal Controller Design problem. These two problems can be addressed within a decision 
making loop that would consider each seperately, and then sequentially analyze the effects of one on 
the other. Embedded in such a loop would be the simulation and coordination tasks as part of the 
decision tools required in a total (software) package. 

All of the above fire compute-intensive tasks. In any such task, possible decompositions and gains 
due to the inherent parallelism have to be exploited. We claim that the problems under consideration, 
as applied to large flexible mechanical structures are particularly suited to be mapped onto multi- 
computer systems in a hypercube topology. 


1. INTRODUCTION 


Issues related to accomplishing integrated design for structural and control systems are of increasing 
concern in the context of Large Space Structures. Indeed a number of attempts have been made to 
come up with unified cost criteria and optimization approaches ([1], (2)). It is evident that one of 
the major hurdles in all such attempts is, said is going to be, computational. For truly large scale 
systems all four aspects, namely finite element modeling, control algorithms, over-all optimization 
and finally total closed-loop simulation singly or jointly create computational problems. 

The use of distributed memory multiple processors connected in a hypercube topology has proven to 
be very useful in many large computation intensive tasks, especially with favorable parameter struc- 
tures and algorithms which are compatible with the said topology. Indeed, the above four problems 
have been addressed individually (possibly in different contexts) for hypercube architectures. For 
example in [3], as a result of finite element discretization, linear equations of banded form are ob- 
tained and solved with an approach based on the Conjugate Gradient method. Experimental results 
on a 16-node Intel 386-based iPSC/2 hypercube have shown an almost linear speedup over a single 
processor implementation. 

Some control design related work, specifically on solutions of quadratic regulator problems have also 
been reported in the literature [4] on hypercube based solution approaches. Yet these approaches have 
not been evaluated within the context of a total design package for Large Space Structures. In this 
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paper we shall report on such an evaluation and introduce preliminary results for the development of 
a a CSI design package which assumes a large system with subsystems under decentralized control. 
The decentralized control design approach is based on the package DOLORES [5] which is being 
modified for the hypercuber 

Distributed memory multiprocessors interconnected in a static topology such as a mesh, a toroid or 
a hypercube have been proposed as architectures particularly suitable for diverse application areas of 
scientific computing. However, in order to use these general purpose parallel computers in a specific 
application, existing algorithms need to be restructured for the architecture and new algorithms 
developed. In fact, conventional algorithms need to be reexamined, since the best algorithm for 
a sequential computer may not be the best for a parallel computer. Parallelization schemes for 
most applications on distributed memory multiprocessors are characterized by the mapping of a 
physical domain or its graph representation to processors with locality of communication. However, 
applications exhibit different characteristics in data dependencies and interprocessor communication 
patterns and volume. Finite element and matrix problems have very regular structures and the 
volume of communication and the amount of computation can be predicted. 

Therefore, the basic model we shall consider is the banded matrix structure obtained from a finite- 
element model. We propose the retention of the nodal form of such models as we move from modeling 
to controller design and back. We intend to fix the number of nodes and nodal variables through the 
optimal design cycle. We will argue for the possible insertion of dummy nodes to retain as broad 
a design space as necessary for doing parametric designs where comparative evaluations of multiple 
criteria will becomeneeded. 

The key reason for keeping the nodal form, possibly with excessive nodes, is doing the mapping to 
the processors only once. Thus, the required communication structure for the various problems will 
remain the same for different iterations of the same problem. 

The configuration of a design package which includes the FEM stage is given in Figure 1. 


2. THE FINITE ELEMENT MODEL 


2.1 The Problems Considered 

The idea behind the finite element method has always been to provide a formulation which can 
exploit the resources of digital computers. The resources provided by multiprocessors, particularly 
hypercube topologies, are especially useful for finite element based analysis and design problems 
where the banded structure of relevant matrices are exploited [6]. The important stages in such an 
analysis are: 

• Obtaining the equations of motion of a structure by deriving the element equations and then 
assembling the equations for all elements. 

• Solving the FEM equations, i.e. essentially solving a set of linear equations corresponding to 
static force-displacement type relations. 
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Optimal Structural Design 



init. cond. 
disturbances 
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Figure 1: The configuration for a design package. 
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• Modal analysis , i.c. a dynamic analysis requiring solution for modes and mode-shapes. 

Thus the problems of concern are solution of equations of the form, 

Kq — BF (2.1) 

for q and 

(jC - u> 7 M^ q = 0 (2.2) 

for the set The nxn M. and K matrices are the mass and stiffness matrices respectively, and 

we assume that the consistent mass matrix approach has been utilized in generating Af. Thus the 

interdepen dancies implied by the elements of the two matrices (location of zeros, etc) are the same. 
The vector q is the nodal displacement vector, and F is the vector of applied forces to the structure 
and the matrix B denotes the influences on individual nodes. 


2.2 Mapping to Processors and K Generation 

The key to having a good total design package is having properties in parameter sets that different 
portions of the package, addressing different problems, can jointly exploit. The basic property we 
want to exploit here is the banded form arising from the FEM, Thus the important first stage is 
obtaining that form and mapping it onto the processors [7]. This is coupled to the so-called K Gen- 
eration operation. The elemental stiffness equations give relationships between each of the nodes 
associated with the element. The global K matrix is the superposition of the individual elemental K 
matrices. Therefore, the individual elemental K matrices may be computed independently and t|u$ 
portion of the generation of K can be done completely in parallel. It is important to note that the 
adjacency matrix associated with the finite element graph is identical in its zero-nonzero structure 
to the K matrix. This knowledge is utilized in a number of subsequent iterative solution schemes. 

A row partitioning of the global K matrix corresponds to mapping a set of nodes onto each processor 
of a hypercube. The mapping determines which rows of K need be resident on a processor in order 
to perform the necessary matrix-vector product operations in any algorithm under consideration. 
The operations to be performed in parallel are then obtained from these rows. 

The set of all elements which contain nodes in the mapped node set for a processor is the associated 
element set for that processor. If the nodes are contained entirely in the mapped set, the elements are 
interior elements . The difference between the associated and interior sets is called the boundary ele- 
ment set. This set provides information for the construction of rows mapped to at least two different 
processors and thus represents data that must be either communicated or calculated redundantly. 1 

Since the volume of data to be transmitted is proportional to the size of the boundary element set, 
it is important to perform the selection of this set carefully. An example of mapping is shown in 
Figure 2. 

1 The so-called Component Mode Synthesis techniques promise to provide a rich avenue of further research coupling 
both computation and control issues into basic FEM. The clear identification of the boundary element set provides 
insight into these relationships. 
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Figure 2. Strip mapping of (a) a finite element domain and (b) the corresponding 
K matrix onto a 2-d hypercube 
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2.3 Solving the FEM Equations 


Methods for solving such equations on sequential computers can be grouped as: direct methods 
such as Gaussian elimination, LU decomposition and Cholesky factorization and iterative methods 
such as Gauss-Jacobi, Gauss-Seidel and Conjugate Gradient methods. Although extensive research 
has been done on parallelization of the solution of large sparse systems of linear equations, new 
architectural features such as the crossbar connection capability and massively parallel distributed 
memory architectures with fast communication offer the potential for new approaches to algorithm 
design. 

Among the iterative methods, the CG algorithm [3], is increasingly being used in sparse matrix 
solvers, since it converges in at most n steps. The computational steps of the standard CG algorithm 
are given below: 

Let b = BF. Initially, choose x 0 and compute r 0 = Po = b - fCx 0 . Then, for k = 0, 1, 2, . . . 


1 . 

2 . 

3. 

4. 

5. 

6 . 


form qj, = Kp k 

a. form < p k , q k > 

b. a k = 

Tfc+1 - T k ~ Otfcq* 
x fc+l = x fc + Ofcpfe 

a. form < r*+i,r* + i > 

b. 0, - 

Hk <*k*k> 

Pfc+l = r fc+l + 0kPk 


(2.3) 


Here, r k is the residual error associated with the trial vector x. k , i.e. r k ' = b - Kx k which must be 
null when x k is coincident with x which is the solution vector and is the direction vector at the 
k-th iteration. As seen from (1), the CG algorithm has three types of operations: matrix vector 
product K p*, inner products < rk+i,r*+i > and < Pk,qfc > and the vector additions required in 
steps 3,4, and 6. To perform these operations concurrently, the rows of K, and the corresponding 
elements of the vectors b, x, r, q suid p must be distributed among the processors by considering 
the communication features of the machine. Details of the implementation can be found in [3] where 
a speedup of over 15 is reported for a 16-node hypercube implementation. 


2.4 Modal Analysis 

Although much research on linear equation solutions for banded matrices has been done, results 
on eigenvalue calculations on hypercubes for banded matrices are very recent. We shall not dwell 
further on this issue except to point out that such an approach will have to be pursued to retain the 
advantages of the mapping described above. 
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3* INCLUSION OF ACTUATORS 


3.1 The Proof-Mass Actuator Model 


It is clear that the addition of an actuator with no bi-directional coupling with structure dynamics 
is simple and will only affect the mass matrix in the FEM. Thus the number of nodes in the FEM 
will not change. Some of them will have to be tagged in order to do a parametrized study of actuator 
location variation. On the other hand, actuators with dynamics coupled to the structure dynamics 
are somewhat more complex. In what follows, we consider such a case, specifically the model of a 
generic proof-mass actuator. (This follows the same lines as the analysis of a specific proof-mass 
actuator on a simple beam, presented in [8].) 

Consider a proof-mass actuatoT attached to a flexible structure at a certain point. We shall simplify 
the problem somewhat by assuming that the point of attachment has been represented by a single 
node in the FEM. Let the nodal position variable (in the same direction as proof-mass movement) 
at this point be denoted by Let the proof-mass have mass m, friction constant d m and spring 
constant Jk m . 1 The equations of motion of the mass m are given by, 


77TZ "I" d m (i <}t) *4“ k m {z Qi) — ft 


(3.1) 


where z denotes the position of the mass with respect to a fixed reference frame and f t is the force 
generated by the windings and can be modeled as a linear DC motor as follows, 

ft = k e i (3.2) 

e = Ri+k b {z-qi) , (3.3) 

where Jfc e , k b are the motor constant and back e.m.f. constant respectively, R is the electrical resistence 
in the the motor circuit, and t and e denote the current and input voltage to the motor. Let us 
define the following constants: 


d, = ^ (3.4) 

b = | (3.5) 

Thus the total dynamics of the proof-mass actuator can be given as, 


mz + (dm + d t )z + k m z - (d m + d e )qi - k m qi = be 


(3.6) 


The reaction force applied to the flexible structure is, 

F = mz = f e - d m (z - qi) - k m (z - qi) (3.7) 

*The last two could also have been added by electronic means, that is by wrapping a local feedback loop around the 
actuator. See [9] for such an example. 
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3*2 Embedding into the FEM 

The actuator that has been analyzed in the present paper is particularly suitable for spacially de- 
centralized control applications since it can be distributed over several locations of a given flexible 
structure for active vibration damping. Some attempts along these lines (using different devices) 
have already been reported in the literature [10,11]. To keep the notation simple, we shall continue 
treating the case with a single actuator. Consider now the dynamical equations representing the 
structure obtained from a FEM, with no actuator dynamics: 

Mq+Kq = BF (3.8) 

For our case B is a vector with all entries zero except the i’th which is 1. We define the expanded 
nodal position vector q as 


(3.9) 


This is essentially equivalent to assigning nodal variables to the proof-mass actuator and considering 
it as a mass-spring system. The back e.m.f* and other effects have to be treated with some care. 
The final result will be an expanded FEM that can be written as, 

Mq + Vq + JCq = Be (3.10) 

where the various matrices can be generated from the original (non-actuated) FEM and the actuator 
model. (An interesting and important observation here is that the structure will have damping with 
the input shorted, i.e. e = 0). 

The point of all the above discussion is: 

1, Actuator dynamics could have been introduced in the FEM stage. 

2. If there are choices in actuator location, all “possible” actuators can be inserted by the intro- 
duction of dummy nodal variables. 
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4. THE OPTIMAL CONTROLLER DESIGN 


4.1 A General Decentralized Structure 

A number of results have appeared in the literature on using a quadratic regulator framework for 
controller design for flexible structures. We shall assume here that a decentralized information 
structure has been imposed. This means that there exists a correspondence between the inputs 


(actuators) and the output measurements (sensors). Let the system be modelled as, 

V 

x = Ax + *»-«i ( 4 -l) 

i=l 

y i = CiX + DiUi ; i = 1, ( 4 - 2 ) 

where x € R" , u» € , y,- € R m * and the matrices are real and of compatible dimension. Due 

to the decentralization constraint, only static feedback is allowed, the control is 

u, = Kiyi ; <=1, •••,*» (4.3) 

or if dynamics are allowed in each feedback loop* the control is 

z, SB FiZi + Giyi (4.4) 

Ui = HiZi + Nw ; t = l,...,i/. (4.5) 

where z< G Ji** . 


4.2 The Decentralized Quadratic Regulator 


Consider a large scale system (4.1)-(4.2) with decentralized control (4.3) . The basic problem is to 
fin d an optimal static feedback gain so that the following cost function is minimized: 


J = 



+ 


+ ^ ufRim) dt 

i= 1 


(4.6) 


and the following feedback structure constraint: 
tii — K{ yi ; * = 1 , . . ., N. 


(4.7) 


It can be shown [12,13,5,14] that the necessary conditions for minimizing J given by (4.6) with the 
controller structure (4.7) imply the solution of the following system of nonlinear algebraic equations: 


/ A T e P + PA C + Q = 0 
\ A C L + LAJ + X o = 0 
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and 


V Ki J = BjPLCf + RiKiCiLCj = 0 
where 

N 

A e = A + ^B.K.C, 

i = 1 
N 

Q = Q + Y, C ? K ? R ' K ' C ' 

i=l 

Ao — *o *o • 

4.3 The Lyapunov Equation Solution 

It is obvious from the above that the solution of a single or coupled multiple Lyapunov equations, 

AX + XA t + Q = 0 (4.8) 

is key to many optimal controller design problems. Indeed the Lyapunov equation has been analyzed 
numerous times and many solution algorithms proposed. In general Schur decomposition based 
algorithms have been the accepted, reliable methods of solving small size problems, where A is dense 
and has no particular structure. Gardiner and Laub have addressed [4] the issue of large size and 
hypercube implementation, however the equations they have considered are still dense.Consideration 
of the sparse case is very recent, and general heuristic algorithms have been proposed in [15] with 
no convergence proof. 

In considering linear equation solutions and m odal a nalysis, we have discussed the case of banded 
matrices arising from the FEM. We have to, however, analyze clearly what the structure of the A 


matrix in the Lyapunov equation will be if it ar ises fr om a FEM. 


Consider the state equations that one can obtain from the original model, 

Mq + Vq + Kq = BF (4.9) 

If we go into modal form from the nodal form, with the unitary transformation, 
z = $ T q (4.10) 

to obtain, 

z + Dz + Kz = BF (4.11) 

where the relevant matrices are, 

D = * T Vi (4.12) 

H = (4.13) 

5 = (4.14) 

we will obtain, in state space, the A matrix as, 
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(4.15) 


A = 


0 

-k 


i 

-D 


The required Lyapunov equations can then be solved in terms of the A matrix above. Note, how- 
ever, that the problem is in modal space and the distribution to processors, accomplished during K 
generation is no longer valid. Besides, calculation of the nodal-modal transformations have to be 
accomplished. 

Another approach which holds some promise is, remaining in nodal space and transforming the 
relevant control design equations. Thus, the equation of interest will no longer be the standard 
Lyapunov equation, but a transformed Lyapunov equation appropriate for systems modeled by second 
order differential equations. This provides the benefit of retaining the banded structure in the 
coefficient matrices without going through a modal transformation stage. 


5* CONCLUSION 


, In this paper we considered a possible design package for large flexible mechanical structures that 
would address CSI problems. We advocate the exploitation of the banded structure that can be 
obtained from proper mappings in the FEM stage. We furthermore advocate the utilisation of 
distributed memory multiprocessors in a hypercube topology as particularly suitable for addressing 
the algorithmic problems. 

Research in this area is somewhat new, but draws on aspects of previous work, in FEM, in matrix 
algorithms and in control design. The key issues are related to 

• Being able to cast multiple problems into a single framework 

• Retaining the same framework without doing unnecessary transformations 

• Introducing dummy variables and parameters which can be used in parametric studies 

Samples of subproblems and cases have been given to indicate the necessity of further work along 
these lines. 
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